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ABSTRACT 

We present the results of a MOPED analysis of ~ 3 x 10^ galaxy spectra from the Sloan 
Digital Sky Survey Data Release Three (SDSS DR3), with a number of improvements in 
data, modelling and analysis compared with our previous analysis of DR 1 . The improvements 
include: modelling the galaxies with theoretical models at a higher spectral resolution of 3A; 
better calibrated data; an extended list of excluded emission lines, and a wider range of dust 
models. We present new estimates of the cosmic star formation rate, the evolution of stellar 
mass density and the stellar mass function from the fossil record. In contrast to our earlier 
work the results show no conclusive peak in the star formation rate out to a redshift around 2 
but continue to show conclusive evidence for 'downsizing' in the SDSS fossil record. The star 
formation history is now in good agreement with more traditional instantaneous measures. 
The galaxy stellar mass function is determined over five decades of mass, and an updated 
estimate of the current stellar mass density is presented. We also investigate the systematic 
effects of changes in the stellar population modelling, the spectral resolution, dust modelling, 
sky lines, spectral resolution and the change of data set. We find that the main changes in the 
results are due to the improvements in the calibration of the SDSS data, changes in the initial 
mass function and the theoretical models used. 

Key words: galaxies: fundamental parameters, galaxies: statistics, galaxies: stellar content 



1 INTRODUCTION 

The quality of spectra of the observed light of unresolved stellar 
populations has reached enough accuracy that it is possible to make 
detailed studies of the physical properties of the stellar popula- 
tions in these galaxies. An excellent example of this new generation 
of da t a-sets is given by the Sloan Digital Sky Survey jGunn et al.l 
ll998l : lYork et alj2000l ; IStrauss et alj20o3) at low redshift, not only 
by the size of the spectroscopic sample (about 10® spectra) but by 
the quality and wavelength coverage of the spectra. At higher red- 
shift the DEEP2 survey (Davis et al. 2003) is providing a similar 
database, albeit with a smaller wavelength coverage. Future spec- 
troscopic surveys (e.g. WFMOS) will yield larger samples at even 
deeper redshifts. Given the quality of the spectra, it is interesting 
to ask the question of whether reliable information about the stellar 
population of these galaxies can be inferred from the spectra. 

Indeed, several attempts have been made previously to 
study in detail the physical properties of the S DSS galaxies ei- 
ther by using selected features in the spectra |Kauffmann et al.l 



full s pectrum jPanter. Heavens & Jimenez! [20031; iHeavens et al.l 
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interesting conclusions about the physical pro perties of these galax- 
ies. In particular analysis of the SDSS sample ^Heavens et alj2004h 
and other local galaxies jThomas et al.ll20()3) show very clear evi- 
dence for 'downsizing' - the process by which star formation at low 
redshift takes place predominantly in low-mass galaxies, whereas 
more massive galaxies have the bulk of their star-formation activity 
at high redshift. also observed using other methods. In addition one 
can also determine the global star formation history in the Universe 
from this fossil record. Our previous study found broad agreement 
with observations of contemporary star formation at large lookback 
times, but suggested that there was a peak in star formation activity 
at 2 < 1. Agreement between the fossil record and contemporary 
star formation indicators is expected if the Cosmological Principle 
holds; this expected agreement offers an opportunity in principle to 
test the assumptions in the modelling of both the fossil record and 
the instantaneous star formation rates. Of most importance is that 
both are dependent on the stellar initial mass function, but in differ- 
ent ways, with the instantaneous rates being determined very much 
by the upper end of the IMF, while the fossil record is determined 
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by a wider range of stars. This offers an opportunity in principle to 
test the evolution of the IMF as a function of cosmic time. 

In our previous work, we have presented results from around 
10^ galaxies from the SDSS DRl. This paper enhances the pre- 
vious results through the introduction of a number of improve- 
ments to data and analysis methods, and through an investigation 
of systematic effects and sensitivity to assumptions. On the data 
side, we have analysed ~ 3 x 10^ galaxies from the SDSS DR3 
sample. The extra size of this sample is not particularly important, 
but the calibration of the data has been improved since DRl. On 
the methods side, we no longer rebin the data to 20A resolution, 
but compare the data with a wider range of theoretical models, in- 
cluding the Bruzual & Chariot models ( Bruzual & Charlot|[2003h 
at 3A resolution. We consider two stellar initial mass functions (a 
Salpeter and a Chabrier IMF), and a wider range of dust models. 
We also make some improvements to the treatment of emission 
lines, by removing additional weaker lines, since interstellar emis- 
sion lines are not included in the stellar modelling. Finally, we have 
also exp lored the effect of rem oval of sky lines, using a PCA-based 
method l lWild & Hewet^l2005l) . These studies give us a reasonable 
estimate of the sensitivity of the results to the assumptions. As is 
expected from a sample of this size, changes in the assumptions 
give rise to much larger variations in the results than the statisti- 
cal errors. The uncertainties in star formation rates from the stellar 
models used are typically about a factor of two, which makes it 
difficult to distinguish between different IMFs, although it may be 
possible to constrain extreme IMF variations. 

The layout of the paper is as follows. In Section 2 we outline 
the enhancements to the method, and the improvements to the data, 
and describe the assumptions which are made for analysis of the 
DR3 sample. In Section 3 we present the new results on star for- 
mation history, downsizing and galaxy stellar mass function from 
the DR3 dataset. We also introduce a method to assess the partic- 
ular areas where current models are lacking. In Section 4 we show 
the sensitivity of our results to changes in the assumptions, through 
analysis of a subset of the data, and in Section 5 we draw conclu- 
sions. 

Throughout this work we assume a concordance cos- 
mology with n„ = 73, p .m = 0.27, Ho = 71 
kms'^Mpc-^ jSpergel et a"i]|2003h . 



2 SDSS DR3 ANALYSIS 

In this Section we describe briefly the SDSS DR3 dataset, and 
outline the assumptions in the m ethod, highlighting c hanges made 
since the analysis of SDSS DRl jHeavens et al]|2004h . 



2.1 SDSS DR3 data 

The spectrophotometric pipeline used by the SDSS has evolved 
from the Early Data Release (EDR) to the DR3 samp le used here. 
According to the papers describing eac h release jstoughton et al.l 
l2002l : lAbazaiian et alj20oil2004l2005l) , the pipeline was changed 
between the EDR, DRl and DR2 subsets, but remained static be- 
tween DR2 and DR3. Some DRl spectra had a slight systematic 
offset for wavelengths smaller than 4000A. Abazajian et al. ( 2005) 
claim that this has been corrected in the DR3. There are no pub- 
lished plans for further improvements. The DRl data has been re- 
reduced with the new pipeline and this work considers the set of 
galaxies contained in the SDSS Main Galaxy Sample (MGS) of 



Table 1. Regions masked from MOPED fitting. 



Wavelength Range (A) 


Reason 


3711-3741 


[Oil] 


4087-4117 


[SII], US 


4325-4355 


H7 


4846-4876 




4992-5022 


[OIII] 


4944-4974 


[OIII] 


5870-5900 


Na 


6535-6565 


[NH] 


6548-6578 


Ho 


6569-6599 


[Nil] 


6702-6732 


[SII] 


6716-6746 


[SII] 



DRl -3 reduced with the DR2 version of the spectrophotometric 
pipeline. 

We apply further cuts to this main galaxy sample based on 
those of Shen et al. (2003). Our sample is determined by r band ap- 
parent magnitude limits of 15.0 ^5 nir ^ 17.77. The magnitude 
limit s are set by the SD SS target selection criteria, as discussed in 
Abaz ajian et al. I ( l2005h . The target criteria for surface brightness 
was fir < 24.5, although for /ir > 23.0 galaxies are included only 
in certain atmospheric conditions. In order to remove any bias and 
simplify our Vmax criteria we have cut our sample at fir < 23.0. 
At low redshifts a small number of Sloan galaxies are subject to 
shredding - where a nearby large galaxy is split by the target selec- 
tion algorithm into several smaller sources. To eliminate this effect, 
for our star formation analysis we use a range of 0.01 < z < 0.25. 
This also removes the problems of non Hubble-flow peculiar veloc- 
ities giving erroneous distances based on redshift, which can have 
a significant effect on recovered stellar mass. For samples involv- 
ing a very large redshift range there is a concern that after Vmax 
weighting an individual galaxy at low redshift can dominate higher 
redshift galaxy signals. For our criteria we have tested this, and no 
Vmax weighted galaxy contributes more than a tenth of a percent 
to the final mass total in each bin. The total number of galaxies in 
the DR3 Main Galaxy Catalogue is 312415, while the number that 
satisfy our cuts is 299571. In order to estimate the completeness of 
the survey we have used the ratio of target galaxies to those which 
have observed redshifts (R Nordberg, Priv. Comm.). This does not 
allow for galaxies which are too close for the targetting algorithm, 
and we estimate thi s fraction at a 6% level from the discussion in 
IStrauss et alj ( l2002h . As a result of both these cuts, our effective 
sky coverage is 2947 square degrees. 

We also remove from our analysis a larger set of wavelengths 
which may be affected by emission lines than in our DRl analy- 
sis. These lines are not modelled by the stellar population codes, 
which only consider the continuum and absorption features. The 
excluded restframe wavelength ranges due to emission or emission 
line filling of features are listed in table[T] We also discount signal 
with wavelength above 7800A in order to reduce the risk of skyline 
contamination. Typically the rest frame wavelength range used for 
analysis is 3450A - 7800A. 
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2.2 Modelling assumptions 

2.2.1 Stellar population modelling 

Both our EDR jPanter. Heavens & JimenezI [20031) a nd DRl 
jHeavens et alj |2004| : Panter. Heavens. & Jimenej |2004|) studie s 
used the 20 A resolution models described in Jimenez et alj j2004h . 
The field of stellar population modelling has moved on in the mean- 
time, and models at 3 A resolution an d better are available from 
various authors. We have used those of ISruzual & Chariot! i2003h 
as the basis for this study. 

Although with 20A models the effect of velocity dispersion 
can be ignored, there is possibility of significant changes in the 
input spectrum when working at 3A. After extensive testing we 
found that in fact there is very little effect, if any, on the recovered 
stellar populations and metallicities for a wide range of galaxies. 
We chose to apply a uniform velocity dispersion of 170 kms~^ 
to the 3A models, reflecting a typical value for the Main Galaxy 
Sample. 



2.2.2 Initial Mass Functions 

Both the instantaneous and fossil approach to star formation de- 
termination require assumptions about the Initial Mass Function 
(IMF). The two methods probe different mass regions and deduce 
the complete stellar populations by assuming an IMF. For instanta- 
neous measures such as H-q or Oil emission the presence of lower 
mass stars is estimated by working down the mass function from the 
massive stars which cause the majority of the emission. For high- 
redshift star formation estimated using the fossil record technique, 
the contribution to the spectrum from older, less massive stars is 
used to determine early star formation, requiring extrapolation up 
the mass function. 

In recent years, the choice of IMF has been the subject of 
much debate - in particular whether a universal IMF can be as- 
sumed, both in space and time. Several candidates have been pro- 
posed for such a IMF, however single stellar population (SSP) mod- 
els only include a few. Although disfavoured by observations, the 
Salpeter IMF ( Salpeter..l955.) has been used as a reference due to 
its simplicity; it i s a power law. The most recent modification is 
the Chabrier IMF iChabriej|200 3^ which seems to be very success- 
ful at reproducing current observations in our galaxy. For the main 
analysis, we use the Chabrier IMF. 



2.2.3 Dust 

Our previous MOPED studies used a single foreground dust screen. 
In this parameterisation the strength of the extinction may be char- 
acterised by E[B — V), and the wavelength dependence of the 
extinction is determined by the choice of extinction model, which 
may be empirical or modelled. We used an LMC extinction law for 
the main analysis, but later we will explore the difference in recov- 
ered SFH using the Calzetti ( 1997) starburst model and the LMC 
and SMC curves given in Gordon et al]l l2003h . 

We have also computed re sults using the two-dust parameter 
model of lCharlot&Falll feOOOl) . This is a more physically moti- 
vated model of the absorption of starlight by dust, which accounts 
for the different attenuation affecting young (< 10 Myr) and old 
stars in galaxies, as characterized by the typical absorption optical 
depths of dust in giant molecular clouds and in the diffuse ISM. 
Unfortunately, our investigations have shown that the absorption in 
giant molecular clouds cannot be well constrained from the optical 



continuum emission alone. This dust component would be more 
tightly constrained by the ultraviolet and infrared continuum emis- 
sion and by emission-line fluxes. We therefore do not show results 
based on this model here. 

2.3 Star formation and metallicity history parametrization 

In the past, the SFH of galaxies was typically modelled by an ex- 
ponential decay with a single parameter - for more complex mod- 
els one or two bursts of formation were allowed. In fact, it would 
be better not to put any such constraints on star formation, particu- 
larly considering that each galaxy may have (as a result of mergers) 
several distinctly different aged populations. Star formation takes 
place in giant molecular clouds, which have a lifetime of around 
10^ years. Splitting the history of the Universe into the lifetimes 
of these clouds give a natural unit of time for star formation anal- 
ysis, but unfortunately it would require several thousand of these 
units to map the age of galaxies formed 13 billion years ago, and 
the (lack of) sensitivity of the final spectrum to the detailed his- 
tory would make any estimate of star formation history extremely 
degenerate. We choose a compromise solution, where we allow 1 1 
time bins in which the star formation rate (SFR) can vary indepen- 
dently. This allows a reasonable time resolution, whilst not being 
prohibitively slow to compute. For most galaxies this parametriza- 
tion is a little too ambitious, so we do not recommend the use 
of the recovered star for mation histories on an individual galaxy 
basis. Extensive testing jPanter. Heavens & Jimene z 2003) shows 
that for large samples the average star formation history is recov- 
ered with good accuracy. Future work will concentrate on recov- 
ering only as much detail as the data from an individual galaxy 
demands. The boundaries between the II different bins used are 
determined by considering bursts of star formation at the beginning 
and end of each period (at a fixed metallicity) and set the bound- 
aries such that the fractional difference in the final spectrum is the 
same for each bin. This leads to a set of bins which are almost 
equally spaced in log(lookback time). Nine bins are spaced with a 
ratio of log(lookback time) of 2.07 in this application of MOPED, 
plus a pair of high-redshift bins to improve resolution at 2 > 1. 
This leads to a set of bins whose central ages are 0.0139, 0.0288, 
0.0596, 0.123, 0.256, 0.529, I.Ol, 2.27, 4.70, 8.50 and 12.0 Gyr. 
The gas which forms stars in each time bin is a lso allowed to have 
a metallicity which can vary independently. The lBnizual & CharlotI 
I2OO3) models allow metalficities between 0.02 < {Z/Zq) < 1.5. 
In order to investigate metallicity evolution (Panter et al. 2007, in 
prep) no regularization or other constraint is applied to the metallic- 
ity of the populations - each different age can have whatever metal- 
licity fits best. A further complexity to the parametrization to deal 
with post-merger galaxies which contain gas which has followed 
dramatically different enrichment processes would be to have sev- 
eral populations with the same age but independent metallicities. It 
is possible to consider a more complex parametrization, but again 
one risks degeneracies in solution. With II ages, II metallicities 
and the dust parameter, the model has 23 parameters. The 23 di- 
mensional likelihood surfa ce is explored by a Markov Chain M onte 
Carlo technique outlined in lPanter. Heavens & Jimene3 j20031 Fur- 
ther in formation on the MOPED algorithm is contained in Panted 
( l2005l) . 

2.3.1 Speed issues 

MOPED jHeavens. Jimenez. & Lahavll2000l) works by a massive 
data compression step, forming (in this case) 23 linear combina- 
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tions of the flux data. The resulting MOPED coefficients are fitted 
by standard minimum techniques. The data compression step 
is carefufly designed to give answers which are (in ideal circum- 
stances) as accurate as performing a full fit to the ~ 3852 flux data. 
One of the benefits of the MOPED algorithm is that the number 
of compressed data is determined by the number of model parame- 
ters, not the number of data points. This has the huge advantage that 
analysis of the spectra at 3A resolution is no slower than analysis of 
20A spectra (except for a small increase in overheads such as pre- 
computing the data compression weighting vectors). The algorithm 
takes around two minutes per galaxy on a fast desktop workstation, 
and represents a speed up of around a factor of 170 over a brute- 
force likelihood fit. 



2.4 Computing Ensemble Results 

2.4.1 Common star formation time bins 

MOPED determines the star formation history of each galaxy, rel- 
ative to the lookback time. To obtain the cosmic SFR it is neces- 
sary to shift these results to a common set of time bins, which for 
simplicity are chosen to be the same as those used in the galaxy 
analysis. This allows direct comparison of galaxy star formation 
between galaxies at different redshifts in terms of cosmic time, and 
ensemble conclusions to be drawn. The shifting algorithm ensures 
conservation of star formation. Star formation in the oldest bin is 
always assigned to the final bin with ages no greater than 13.7 Gyr, 
the age of the Universe determined by the concordant WMAP cos- 
mology. 

2.4.2 Inverse Vmax weighting for Fossil studies 

In a magnitude-limited survey such as the SDSS the range of galaxy 
types and sizes included in the survey will vary over the redshift 
range studied. Some mechanism is required to compensate for this 
change and determine the overall bulk parameters for the sample. 
To convert from star formation rates to star formation rate density, 
galaxies are weighted by 1/Knax, where Vmax is the maximum 
volume of the survey in which the galaxy could be observed in the 
SDSS sample. This gives an unbiased estimate of the space density 
/ of any additive property F of the galaxy under investigation, such 
as mass, luminosity, star formation rate. 

/= E (1) 

galaxies z 

On smaller scales the estimator is affected by source clustering, 
but the SDSS is deep enough that these variations should not be 
significant. Any properties which change with redshift and which 
could determine inclusion in the sample must be calculated for each 
galaxy over the redshift range of the survey to determine whether or 
not it would have been included. In order to calculate the Vmax as- 
signed to each galaxy it is necessary to consider the apparent mag- 
nitude and surface brightness evolution over the redshift range. In 
order to compute this we use the same stellar evolution models used 
in the MOPED analysis to calculate luminosity over the lifetime of 
the galaxy due to its recovered star formation history. 

The magnitude of a galaxy over its lifetime depends on the lu- 
minosity behaviour of the various stellar populations that make it 
up - the star formation history. As young stars, the populations will 
have a very high light output, which will reduce as they age. This 
information is encoded in the galaxy spectrum and recovered by 



MOPED, which gives the relative fractions of different aged pop- 
ulations. To compute the observed magnitude if the galaxy were 
to be observed at higher redshifts, we need to evolve the models 
over time and track the changes in luminosity. Obviously, as the 
galaxy is projected to a further redshift the younger fractions do 
not contribute, as galaxy is being 'observed' before these popula- 
tions were bom. Since the spectral energy distribution of the light 
changes with evolution of the galaxy, it is also necessary to apply 
the filters used by Sloan to determine the flux included in the r 
band, leading to a change in Ci,^ in the r-band magnitude as the 
redshift is changed: 

These magnitude corrections can then be used to calculate cor- 
rections which need to be applied to the surface brightness. The 
surface brightness of the galaxy at each redshift z is 

^^i,z = ri,2-|-Ci,2+2.51ogio [n:rlo_i{Dz/ Dz^^^f]+2.^\og^Q 2(3) 

where Dz is the luminosity distance, rgo is the Petrosian half 
light radius and Zobs is the observed redshift of the galaxy. This 
equation assumes that the size of the galaxy does not change over 
the redshift range. Although this assumption is valid for low red- 
shift sources, it will need to be developed if the technique is applied 
to deeper surveys. 

The MOPED technique gives the relative strengths of the dif- 
ferent spectral models. The mass originally created to make these 
masses is then calculated, and by dividing this by the maximum 
volume over which the galaxy could be observed gives the star 
forming density, pi. By adding all the Vmax weighted star form- 
ing densities of galaxies in the sample, rebinned to a common time 
frame, the overall star forming density p can be found for the region 
studied. 

Galaxies may only contribute to the SFR of a time bin if they 
are at a lower redshift than the lower limit of that bin. This con- 
servative approach ensures that the star formation of a galaxy is 
never extrapolated. It also means that in all but a few cases where a 
galaxy is almost on the boundary between bins, the youngest pop- 
ulations, those which are better calculated using instantaneous in- 
dicators from emission lines, do not contribute to estimates of the 
SFR. In addition, to ensure that our results are not biased by single 
erroneous SFH reconstructions we only present results from bins 
which contain contributions from greater than 1000 galaxies. 



3 RESULTS FROM THE DR3 

3.1 Evolution of Stellar Mass Density 

The simplest interpretation of the combined star formation histories 
is a simple Vmax weighted addition that shows how the present day 
stellar mass of the Universe has built up. In Fig.[T]and Table|2]we 
show how the stellar mass density of the universe has changed since 
= 2. In order to compute the remaining mass (when from the 
MOPED algorithm we estimate the original mass of stars formed) 
it is necessary to invoke the recycling fraction, R. Rather than as- 
sume a blanket correction, we have used the detailed predictions 
allowed by the ' Bruzual & Chariot! ( l2003h models for each popu- 
lation of each galaxy, based on their determined ages and metal- 
licities. Although this is slightly more laborious than assuming a 
blanket recycling fraction it gives a more accurate determination of 
present mass, since R is a function of both age and metallicity. 

The abscissa on this plot refers to the minimum redshift of a 
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Figure 1. The build up o f the current stellar mass density of the Universe, as determined from the SDSS DR3 galaxies and the recycling fractions from the 
iBruzual & Charloll i2O03ll models with appropriate Vmax weights (heavy dots, solid line). In all cases statistical erro rs are contained within the dots marking 
the actual data points. Also plotted for comparison are estimates from other surveys: HDF-N.'Pickin sonetalJ i2003l) (red plus, oran ge box denotes their error 
estimation), HDF-S, Rudnick et al. (20()3)(purple stars), HST/CFRS, Brinchmann & Ellis l2000)(blue triangles), K20. lFontana et al . (2004)(green diamonds), 
FORS Deep and GOODS-South Fields, Drory et al. 1 2005)(hollow blue circles). For comparison with simulations we also show the semi-analytical results 
based on the Millennium Simulation of Croton et al. 1 20()6)(dotted line), normalised to our own final stellar inass density. Insert shows exactly the same data 
plotted on a linear redshift axis. No attempt has been made to convert between assumed initial mass functions. 



Table 2. The buildup of stellar density derived from the MOPED/SDSS- 

DR3 fossil record. 



z 


Stellar Mass Density 


Min Density 


Max Density 






logio(M0Mpc-3) 




0.0130 


8.459 


8.401 


8.558 


0.0272 


8.459 


8.401 


8.557 


0.0575 


8.458 


8.401 


8.556 


0.125 


8.454 


8.399 


8.549 


0.286 


8.427 


8.377 


8.516 


0.705 


8.362 


8.317 


8.447 


1.87 


8.222 


8.186 


8.309 



given bin, when all the stars formed in that bin will be in place. 
Simple interpolation between these points is valid as long as the 
star formation rate is constant across the bin. From this it is easy to 
see that approximately 60% of the present day stellar mass was in 
place by 2 = 1, and 90% by z = 0.35. It is also clear that in our 
sample there is overall very little star formation at z < 0.1. 

For comparison we also plot the mass buildup estimated from 
various surveys. We also show the results from se mi-analytical 
mode lling based on the Millennium Simulation jCroton et al.l 
l2006h . The data shows excellent agreement between the largest as- 
tronomical survey and the largest simulation ever undertaken. The 
statistical errors on the points in Fig[T]are so small that they are con- 



tained within the points. This is of course unrealistic, and we have 
also included a more realistic treatment of errors in our ensemble 
estimates by selecting many different (but still reasonable) sample 
criteria and comparing the ensemble results that result from each 
sample. These separate samples vary between minimum redshift 
(0.001, 0.003, 0.006, 0.009, 0.01, 0.03, 0.05), maximum redshift 
(0.2, 0.3, 0.34), bright limiting magnitude (13.5, 14.0, 14.5, 15.0), 
faint limiting magnitude (17.7, 17.77) and surface brightness cuts 
(23.0, 23.5, 24.0). They show the maximum and minimum recov- 
ered quantities from the many ways of selecting source galaxies, 
and should be indicative of the systematic errors from basing the 
analysis on different galaxies, although some of this variation will 
come from real differences between the histories of the galaxies 
selected. The extents of these systematic (and heavily correlated) 
error approximations are not plotted, but are given in Table |2] as 
maximum and minimum densities. In the case of the cumulative 
stellar mass density the overall mass can vary by up to 25%. 

3.2 The cosmic star formation rate 

In Fig. [2] we show the cosmic star formation recovered from SDSS 
DR3 with a Chabrier initial mass function, using the Bruzual & 
Chariot (2003) 3A resolution spectral synthesis models, and a 
single-parameter dust screen following the extinction curve of the 
LMC. 

As seen before in many studies, the steep decline in SFR is 
clearly demonstrated. In contrast to our previous study, we do not 
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Figure 2. The cosmic star formation liistory of tlie Universe, as determined from the SDSS DR3 galaxies (heavy black horizontal bars cover the bins, the lumi- 
nosity weighted age of each bin given by black dots). The vertical error bars are indicative of systematic errors arising from choosing different su bsets of the data 
for analysis. See text for full details. Also plotted for comparison are the estimates from our DRl work (black diamonds) ("He avens et al.l2004l) and those com- 
piled in|Hppkins 1 2004), using the common obscuration discussed within that paper The points have also been corrected for the minor difference in cosmology 
between that work and this paper, and shifted downwards by 0.25 dex to convert from the Salpeter to Chabrier IMF as indicated by the an'ow in the top left. The 
[Hopkins (2004) points are coded as follows: UV indicators (dark blue): Giavalisco et al. (2004); Wilson et al. (2002); Massarotti et al. (2001); Sulli van et alj 
12000); Steidel et al. ( 1999); Cowie et al. ( 1999); Treyer et al. ( 1998); Connolly et al. ( 1997); Lilly et al. ( 1996); Madau et al. ( 1996 ); Oil H-g and H-/3 emis- 
sion (r e d): Teplitz et al. (2003); Gallego et al. (2()02.);,HQgg et al. (1998); Hammer et al.. (1997); Pettini et al . ( 1998); Perez-Gonzalez et alJ<2003l);lTresse et al , 
|2002[) ; iMoorwood et alj i200(]|); iHopkins et al.l j200ol);l sullivan et a l.l 12000^ ; iGlazebrook et^alj il999l) ; | h999l); iTresse & Maddoxl h998l) ; iGallego et al 
1199: 



,, _ IGallem 
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find a peak in SFR at « < 1. This was also found bv lMathis et alj 
( I2006) . in work based on the MOPED fossil analysis approach. If 
there is a peak, it occurs somewhere in our last bin {z > 2). These 
new results from the fossil record are in much better agreement 
with determinations based on contemporary star formation rates. 
Purely statistical error bars are so small to be almost invisible in all 
but the lowest redshift bin. As with the previous figure, the vertical 
error bars in Fig|2|are indicative only. They show the maximum and 
minimum recovered star formation rates from the many ways of se- 
lecting source galaxies. They should be indicative of the systematic 
errors from basing the analysis on different galaxies, although some 
of this variation will come from real differences between the histo- 
ries of the galaxies selected. 



Table 3. The SFR density derived from the M0PED/SDSS-DR3 fossil 
record. 



z Zmin Zmax SFRd Min SFRd MaxSFRd 

(MQyr^^Mpc^^) 



0.081 0.0575 0.125 0.00537 0.00276 0.0117 

0.177 0.125 0.286 0.0166 0.0131 0.0270 

0.416 0.286 0.705 0.0226 0.0187 0.0294 

1.03 0.705 1.87 0.0332 0.0275 0.0388 

3.24 1.87 6.42 0.0993 0.0850 0.113 



Although for completeness we show our results from z ~ 0.1, 
we do not present points with z <C 0.1 for three reasons. First, 
from Fig. [Tj it is clear that there is much less mass in these bins 
on which to form an estimate of the SFR, the change in mass with 
time. Second, to avoid biasing, our Vlnax criteria exclude galax- 
ies from contributing to bins with upper boundary lower than their 
redshift - hence the bulk of the galaxies, at approximately 2 = 0.1, 
can only contribute to bins from z — 0.2 onwards. Third, the galax- 
ies contained in these bins, and the resultant SFR, are strongly de- 
pendent on sample criteria, as expected when sample size drops. 



Later in this paper we analyse a subset of the data to determine 
which changes are responsible for the modifications to the results; 
the main reasons for the changes are the better c alibration of the 
SDSS DR3, the change in IMF to lChabrieJ ( l2003h and the change 
to the higher-resolution Bruzual and Chariot (2003) models. The 
star formation rate density resulting from the MOPED DR3 analy- 
sis is presented in table[3] 
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3.3 The mass function of stellar mass and fit. 

The galaxy stellar mass function of SDSS DR3 is shown in Fig. [3] 
for a range of almost 5 decades in mass (10'^ - 10^^ Mq). The er- 
rors shown are statistical based on our chosen sample criteria. We 
also compute the mass function for the alternative sample criteria 
to develop systematic errors. Over much of this range a Schechter 
fit is good, with parameters (j)* = (2.2 ± O.bstat ± Isys) x 10^^ 
Mpc -^ M* = (1.005 ± 0.004,tot ± 0.200^;,,) x lO" Mq, 
and slope a = —1.222 ± 0.002stat ± O.lsys calculated in the re- 
gion where there are more than 300 galaxies contributing to each 
bin (10**-^ - iqII- SSMo). The mass function is ve ry similar to 
our DRl analysis jPanter. Heavens. & Jimene^l2004l) . but shifted 
to lower masses as a result of the use of the Chabrier IMF rather 
than Salpeter For further discussion of the systematic differences 
in recover ed galaxy mass caused by IMF variation refer to the dis- 
cussion in lBell & de Jond ( 1200 ih . 

The stellar mass function can be used to give a further con- 
straint on the contribution to the density parameter from baryons in 
stars, Sljj*. By integrating the mass over the range of the mass func- 
tion we deduce a value of f2b* = (1.82±0.03stat±0.1si/s) X 10^^ 
(systematic error). This value is in broad agreement with results 
obtained previously when the correction from Salpeter IMF is 
taken into acc ount (Panter, Heavens, & Jimenez 2004; Cole et al. 
,2001; Bell et al. 2003; Fu kugita et a l. 1998; Kochanek et al. 2001; 
Glazebrook et al.. 2003. ; , Persic et al.ll992 ; Salucci et al..l999) . The 



statistical errors reflect the spread of results obtained with varying 
sample criteria as before. 



3.4 Downsizing 

One of the results of i Heavens et al.l ( l2004h was the finding of 
'downsizing' from the SDSS fossil record. Using the new models 
at higher resolution we have found that the evidence for downsizing 
is just as clear. In Fig.|4]we show the cosmic star formation rate for 
galaxies split into different stellar mass ranges. A clear signature of 
'downsizing' is seen: the stars ending up in today's highest-mass 
galaxies formed early, and show negligible recent star formation, 
while the lower-mass galaxies continue with star formation until 
the present day. The lower, non-offset plot can be used to determine 
for a given redshift which galaxies dominate the star formation rate. 



3.5 Investigating spectral residuals 

Due to the power of MOPED and the number of SSPs offered for 
fitting (11) excellent fits can be obtained if the models are accu- 
rate. By comparing the residuals of the best fitting spectrum to the 
raw data on a pixel by pixel basis and then averaging over many 
galaxies in the galaxy restframe it is possible to determine exactly 
which areas are not being accurately fitted for a given spectrum. 
By stacking the residuals of high signal to noise (Sloan SPECOB- 
JALL.SCI_SN, science S/N > 20 per flux measurement) galax- 
ies it is possible to determine to a high degree of accuracy which 
wavelength ranges are failing in the models, and to what extent. If 
we assume that MOPED can, in most cases, obtain the best pos- 
sible fit to a spectrum then the differences must be features not 
included in the models. These features could be things that the 
models are not designed to measure (instrumental effects, inter- 
stellar or intergalactic medium absorption, skyline contamination 
etc.) or alternatively features that are either misrepresented or not 
yet included in stellar modelling codes (e.g. alpha enhancement 



dThomas. Maraston. & Bendei 20031), emission lin es, helium pro- 
duction of dY/dZ ~ 1.5 — 2 I Jimenez et alj2003h and other spec- 
tral features). Fig[5]provides some insight into which features in the 
residuals can be identified. Since we subtract the modelled spec- 
trum from the data, unfitted emission will have a positive effect on 
the residuals while unfitted absorption will be negative. Both ob- 
vious emission lines and filling of absorption features should be 
detectable. The spectra used for this analysis are those which have 
already had the strongest emission line regions removed, as detailed 
earlier - in this case the residual is simply zero. 

The first panel of Fig. [5] shows the mean spectrum. To distin- 
guish between galaxy features and skyline features we select two 
redshift ranges with the same central redshift, z — 0.1. Since the 
averaging of residuals is carried out in the galaxy restframe, in- 
creasing the redshift spread will act to spread any skyline features. 
The third panel gives the residuals for galaxies within 0.001 of the 
central redshift while the second has a range of 0.01. The fourth 
panel shows the second subtracted from the first. In this case, sky- 
line/instrumental will create regions with large amplitude. The rel- 
evant skyline features (and their convolutions with the extremes of 
the particular redshift distribution) are shown and correspond ex- 
actly. It can be seen from the third panel that the majority of the 
spectral range is remarkably clear of skyline contamination - testa- 
ment to the high quality of the Sloan spectroscopy. 

Considering the regions which are not ex cluded by skylines 
we begin to be able to assess the ability of the lBruzual & Charloll 
I2OO3) models to fit the data. It is clear that in all regions where 
weak emission filling could be present and has not been masked 
there is a slight positive tendency in the residuals (although this 
is of course not a failing of the models but a reflection of possible 
systematics affecting our fits), and many of the features in the resid- 
ual correspond to features that may be affected by alpha enhance- 
ment. A more detailed study of individual indices and their alpha 
enhancement, and their resolution in more advanced models includ- 
ing variable abundance ratios will be presented in a later paper. Al- 
though the vast majority of features in the residual can be directly 
related to specific lines there is a strong signal just red of the mag- 
nesium line at restframe 5 176A. Although tempting to attribute this 
to poor fitting of the magnesium feature, it could also be interpreted 
as part of a broader feature between restframe 5200-5900A. It is in- 
teresting to compare the fitting in this region to the area around the 
4OOOA break as both are heavily dependent on metallicity. It is sur- 
prising that the models do so well in fitting the break but cannot 
simultaneously fit this region, and taken with the excess around the 
Calcium Hydride band ( 6830-6900A) suggest that perhaps the bal- 
ance of K-M giants in the models must be improved, the resolution 
of which would lead to a bluer continuum. 

Converting this feature to the observed frame it coincides with 
the dichroic region of the combined red and blue SDSS spectro- 
graphs. If this were to be the cause one would expect that the fea- 
ture would appear in the third panel, which it does not, and certainly 
further work would be necessary to interpret this residual feature in 
the context of the DR3 spectrophotometric calibration pipeline. 



4 THE IMPACT OF MODEL CHOICE 

In this section, we investigate the influence on assumptions on the 
results obtained in the last section. There have been several im- 
provements and changes since our analysis of DRl, and the results 
have changed to some degree. The purpose of this study is to see 
how the assumptions change the conclusions, and to get some idea 
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Figure 3. The galaxy stellar mass function of SDSS. Also shown are those o f lColeetaljj200lh (Orange) and lBell et al] <2003l) (Blue), corrected for differences 
in IMF. The errors quot ed on the parame ters are based upon the statistical errors of the galaxy sample rather than systematics, which are discussed in the text. 
The red solid line is the lSchechte"ilh976l) best fit solution over the quahfying points, the dotted section is an extrapolation over the range covered by the data. 



of the systematic effects introduced by choices of such things as the 
stellar populations used to model the spectra. 

4.1 Sample studied 

Although the MOPED algorithm allows rapid analysis of different 
modelling choices, to investigate a wide range of parameters it is 
necessary to cut the sample down to something more manageable. 
We chose to operate on the main galaxy sample spectra in plates 
0288 and 0444. These two plates form a representative sample of 
808 galaxies from two widely separated patches on the sky. In total 
767 of the 808 satisfy the criteria used in our main analysis and 
would appear in our estimates of SFR. 

4.2 Star Formation Fractions and Star Formation Rates 

There are two places where modelling choices affect the shape of 
the recovered cosmic star formation rate - the estimation of the rel- 
ative fractions of different SSPs and the calculation of a V^max cor- 
rection based on that SFH. For this analysis we wish to investigate 
the two stages independently. To achieve this we need to compare 
both the SFF recovered using different models and the SFR that 
results. 

The galaxies' star formation fractions (SFF) and total mass 
are calculated in their individual rest frames. This SFH is then con- 
verted to a common frame for all galaxies and weighted in propor- 
tion to l/Vmax- Both the original SFF and the 1/Vmax weighting 
are calculated by the models, and the mass is dependent on the IMF. 
Our first comparison (Fig.|6} shows the initial normalized SFF re- 
covered for the galaxies averaged in their restframes. Since it is our 
intention to compare the relative fractions produced by changing 
model parameters rath er than the overall mas s (which has been de- 
scribed elsewhere, see iBell&de Jonltolh ) we present the SFF 
normalized to total star formation of 1 per galaxy . This should 



Table 4. A summary of models used in the analysis. 



Name 


Reference 


FWHM 


SPEED 


Jimenez et al. (2004) 


20A 


PEGASE 


Koc&Rocca-Volmerange (1997) 


20A 


BC93 


Bruzual & Chariot (1993) 


20A 


Mai'aston 


Maraston (2005) 


20A 


GALAXEV 


Bruzual & Chariot (2003) 


3A 



give a clear indication of differences on recovered star formation 
fractions in the models. 

Next we compare the cosmic SFR calculated using these var- 
ious modelling choices. This method allows direct comparison of 
the SFR obtained using each model, but is subject to greater errors 
due to the sensitivity of the Vmax calculation on individual star 
formation histories and the large relative weight change this can 
produce. We present this model-dependent SFR in addition to the 
SFF but caution that the number of galaxies present is insufficient 
to draw robust conclusions. Given the substantially smaller number 
of galaxies, the errors are far larger than those of the complete DR3 
analysis presented earlier in this paper. 

A summary of the models is given in table|4l unless otherwise 
stated a lSalpeteilll955l) IMF is used to calculate the SSPs. 

4.3 Sample galaxies compared to full dataset 

Fig[6ji shows the difference in recovered SFF between our original 
DRl analysis and our subsample. The differences are due to the 
number of galaxies (120,000 in the full DRl vs. 808 in the DRl 
Sample). For all further comparisons in this section the sample of 
galaxies will be 808 identified in the MGS plates 0288 and 0444. 
FigH^ shows the difference in SFR. 
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Figure 4. The star formation rate of galaxies of different masses. Tliese plots sliow tlie contribution to tlie overall star formation rate in the universe from 
galaxies with different masses over the redshift range we consider reliable. In the upper panel the SFR has been offset to enable easier comparison of the 
curves, in the lower there has been no offset applied. It is clear that the more massive systems formed their stars earlier, although no conclusion can be drawn 
as to the number of objects these stars formed in. 



4.4 Pipeline changes DR3 vs DRl, PCA cleaning of skylines 

Fig|6j) shows the impact of changing the pipeline used to reduce 
the original raw data. DR3 contains a more accurate calibration of 
the continuum of the spectrum and a systematic trend that devi- 
ated the continuum blueward of 4000 A has been corrected. The 
SPEED model, at a resolution of 20A was used for all three sets. 
Except for bins 6-9 (numbered from the right), the variation the two 
datasets introduce is very small, at the few percent level. However, 
for these bins the deviations are as much as a factor of three. Note 
that, where present, continuum discrepancies between the blue end 



for DRl and DR3 were on average 10% in the flux (at 3500 A). It is 
remarkable that such drastic changes introduce such small changes 
on the average physical properties of the sample. 

It is reassuring that the changes made to the spectra by PCA 
removal of skyline regions have virtually no effect on our SSP fits. 
It is important to realise though that the majority of the regions 
cleaned by the Wild & Hewett (2005) code are outside of our sam- 
pled wavelength range. 

The effect of the pipeline on the SFR shown in FigjSj) is far 
greater. The mean flux of the spectra between the two releases has 
changed by a factor of between 1.5 and 2, and the masses reflect 



10 



Panter et al. 



ave doto-fit / error 

ove doto-fit / error ove doto-fit / error normalised flux 




Figure 5. The mean spectram and the mean of the residual divided by the error for MOPED fits of DR3 galaxy spectra. The first panel shows the average 
spectrum for the galaxies in with high S/N and z = 0.1 ± 0.01. The second and third panels show the residual at each pixel divided by the error at that 
pixel averaged in the rest frame. The first plot shows the average for 747 high S/N galaxies within the range z = 0.1 ± 0.001, the second for 7406 high S/N 
galaxies within z = 0.1 ± 0.01. The third plot shows the first residual set minus the second, and clearly separates areas contaminated from skylines (which 
are broadened in the second plot by the redshift range). Emission features are labelled in red, absorption blue and skylines green. 
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Figure 6. The systematic changes in recovered SFH for various modelling choices. The labels show the various modelling choices applied to generate the 
SFFs, and are expanded upon in the text. It is important to note that these are the fractions of total mass formed over the lifetime of the galaxy, and not the 
fractions of light contributing to the recovered spectrum. The light from the youngest populations is some 300x brighter than the oldest. The fractions are 
normalised, so changes in overall recovered mass for spectra are not apparent. 
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is important to note that these are the fractions of total mass formed over the lifetime of the galaxy, and not the fractions of light contributing to the recovered 
spectrum. The light from the youngest populations is some 300x brighter than the oldest 



this change. Even ignoring this change in the normalization of the 
SFR it is clear that the trend for a low redshift peak is no longer 
present - the SFR appears to fall monotonically to the present day. 
This interpretation does not exclude a peak at a higher redshift than 
our eldest bin. 



4.5 Stellar population models 

Figs. [6}; and [7] show the compariso n between five differ ent stel- 
lar populatio n synthesis models: the Jimenez et al.l j2004h SPEED 
models, the iFioc & Rocca-Volmerangd ( 1 199"^ PEGASE mod- 
els, th e iMarastonI j2005l) RHB mode ls; the iSruzual & Chariot' 



( Il993h models and the more modern SA lBruzu al & Chariot (2003) 
GALAXEV models rebinned to 20A. The comparison is done at 
20A for the Salpeter IMF and for the one-parameter dust model. It 
is important to establish that this analysis cannot say which model 
set is 'right' , only assist in understanding the differences between 
models. The overall shape of the SEE is in reasonable agreement - 
although there are certainly discrepancies between the populations 
that are recovered. For the very oldest populations the different 
models agree very well. This is not entirely unexpected of course, 
as the stars which contribute to this area of the age-metallicity pa- 
rameter space are well studied and dominate the emission at red 
wavelengths. The different models also predict roughly similar pro- 
portions of the very youngest populations, which rely on similar 
prescriptions for the evolution of blue massive stars and can be 
constrained through the emission at the bluest wavelengths. At in- 



termediate ages the agreement is not so good: this is likely to be 
caused, at least in part, by the difficulty in recovering the fraction 
of intermediate-age stars in stellar pop ulations with declining star 
formation histories (see discussion by iMathis et alj |2006)). This 
tends to produce an artificial step around 1 Gyr in the star formation 
history, except perhaps in the M arastoi] ( I2OO5I) model. The differ- 
ent behavior of this model probably results from the different pre- 
scription for bright Thermally Pulsing Asymptotic Giant Branch 
(TP-AGB) stars. The contribution by these stars to the integrated 
light is still subject to controversy in current population synthesis 
models. Since our spectra are fitted in addition, any poorly fit com- 
ponent will be replaced by another. 

Fig.[8j; shows the difference that the various models make to 
the recovered SFR. It is clear that there is a large spread - as dis- 
cussed earlier, this is due to the fact that the models contribute both 
to the estimation of the SFH and the Vinax weights attributed to the 
galaxies. In all models except those of the PEGASE group the SFR 
decreases monotonically from the oldest to the youngest bins. 



4.6 The impact of resolution, 20A vs 3A 

FigUJi shows the impact of increasing the spectral resolution of the 
data. In this case we use the BC03 at resolutions of 20A and 3A 
on DR3. The differences between the two curves are very small, 
the maximum deviation is only of about 30% in bin 4 and smaller 
in bin 5. For the other bins the agreement is remarkable. What this 
comparison is telling us is that 20A is sufficient resolution to deter- 
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Figure 8. The systematic changes in the overall SFR for various modelling choices. The labels show the various modelling choices applied to generate the 
SFFs, Vmax , mass and SFR and are expanded upon in the text. We plot only those bins used for the analysis in the main section, and caution that due to the 
large Vmax corrections there are insufficient galaxies in this sub-sample to draw robust conclusions 
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mine the average properties of galaxies. Tlie higlier resolution does 
not add much extra information to this. More importantly, the result 
is not biased at the lower resolution. This is not entirely surprising 
since the continuum certainly contains info rmation about both ag e 
and metallicity of a stellar population (e.g. Ijimenez et al.l ( |2004[) ). 
The shape of the SFR recovered in Fig.[8}l is remarkably consistent 
between the two resolutions, with a significant variation in only one 
bin. 



4.7 IMF: Salpeter vs. Chabrier. 

Fig|6^ and [8^ show the impact of changing the initial mass func- 
tion on the recovered SFF and SFR. In this case we have chosen to 
work with the BC03 models at 3A resolution. The IMF determines 
the initial distribution of the number of stars as a function of mass. 
It is therefore not surprising to find changes in the recovered star 
formation history for dramatic changes in the IMF. The Chabrier 
and Salpeter IMF are very similar for masses larger than 1 — 2 M0 
(they are both power laws) while they differ considerably at smaller 
masses: the Salpeter IMF continues being a power law (with index 
— 1.35) while the Chabrier IMF deviates containing a much smaller 
number of small mass stars relative to the Salpeter IMF. As panel 
|6^ shows the main difference occurs at the oldest bin (bin 1), with 
smaller differences in bins 2 — 4 and virtually no differences for the 
youngest bins when the bootstrap error bars are taken into account. 
This effect is propagated through the Vmax and mass calculation 
to the SFR. This is more-or-less what one would expect: for the 
oldest bin, the Chabrier IMF has to compensate its relative lack of 
low mass stars compared with the Salpeter IMF by forming more 
of them. The differences then disappear as more massive stars are 
more dominant at recent ages. It is interesting that although our 
results will be affected by varying the IMF, it will be in a differ- 
ent sense from the results from instantaneous SFRs attained at high 
redshift. Where as virtually all instantaneous indicators estimate 
the total mass of stars from the very high mass UV emitting stars, 
our technique uses the low mass remnants. The fact that the two ap- 
proaches agree suggests that the IMFs currently in vogue are along 
the right lines, and a direct comparison of sufficiently accurate indi- 
cators from both the instantaneous and fossil approach could allow 
IMF fine-tuning. This approach would also require an accurate un- 
derstanding of the dust in star forming regions, a subject of some 
controversy in the literature. This topic will be discussed in a future 
paper. 



4.8 Dust modelling 

One of the most difficult problems in modelling stellar populations 
is how to mod el the attenuation of the population by dust. For our 
previous study l lHeavens et alj2004h we adopted a simplified model 
with only one parameter (the attenuation) while the spectral depen- 
dence of the attenuation was taken to be that of the Large Magel- 
lanic Cloud. Alternate formalisations for the screen are based on 
the Small Mag ellanic Cloud, or es timated for starburst g alaxies by 
ICalzettil ( fT99% . On the other hand, ICharlot&Fa"iil ( l200(]|) have pro- 
posed that a more accurate modelling of the effects of dust atten- 
uation in galaxies can be achieved by a two-parameter model. In 
this model one parameter accounts for dust in the giant molecular 
clouds surrounding young stars (of ages< 10 Myr) and the other 
parameter the dust in the diffuse ISM. The combination of the two 
parameters allows a more complex extinction curve to be generated. 
Unfortunately, since our method does not include the contribution 



to the spectrum from emission lines, it is not possible to determine 
the extinction from the birth cloud from continuum alone. 

Fig[6f and [8f show the differences between the LMC, SMC 
and Calzetti models for the BC03 models at 3A resolution. The 
most significant difference occurs clearly at about 0.1 Gyr. 



5 RESIDUALS OF BEST FIT MODELS 

The average residuals from the best-fit solutions of the different 
models is shown in Fig. |9] This provides a more detailed look at 
how well the models are faring at reproducing the features in the 
observed spectra and if some models do better than others. It can be 
seen that the 20A models cover essentially the same range of spec- 
tral features, and it would be impossible to say that one is better 
than another. The picture changes when we consider the 3A mod- 
els however, they are clearly superior at minimizing the residuals - 
even when rebinned to 20A resolution. Although there are still sev- 
eral regions where features are not quite correct, it is on the level 
of individual lines rather than wide bands of the spectrum. Com- 
paring panels (a) and (b) allows us to investigate the effect of the 
improvement in photometric calibration between DRl and DR2-3 
on line strengths - practically none, as it should be. 

This average deviation should not be confused with average 
goodness of fit however, as inspection of the relevant average 
of the samples shows a slightly different story. The models have 
essentially infinite precision, so there is no penalty associated with 
rebinning. The converse is true for the spectra, as binning pixels 
while correctly propagating the error will reduce the standard devi- 
ation. 



6 CONCLUSIONS 

We have used MOPED to probe the fossil record of star formation 
encoded in the spectra of more than 300, 000 SDSS-DR3 galax- 
ies to determine stellar populations, metallicity evolution and dust 
content. We have also investigated the impact of systematics on 
recovering physical information from the fossil record. Our main 
conclusions are: 

• The main impacts on systematic variation in the estimation 
of SFR from the fossil record are due to the the stellar population 
model, the calibration of the observed spectra and the choice of the 
IMF. 

• We find strong evidence for downsizing, independent of model 
choice. 

• The overall star formation history of galaxies recovered from 
the fossil record agrees well with instantaneous formation measure- 
ments. 

• The mass build-up recovered from our analysis is in good 
agreement with that predicted from both high redshift studies and 
the semi-analytic simulations of galaxy formation based on Millen- 
nium Run. 

• We have identified the cause of many of the residuals from the 
spectral fits. Stellar population models that provide extra freedom 
in terms of alpha enhancement should provide better fits. 

The fossil record continues to provide a useful tool to unveil 
the physical conditions of galaxies in the present and the past. With 
improved models that incorporate alpha-enhancement it should be 
possible to constrain further models of galaxy formation and evo- 
lution and the initial mass function of galaxies. 
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SPEED models; c) D R3 data.lFioc & Rocc a-Volmerang3 119971) PEGASE models; d) DR3 data. lMarastonI fcOOSi) RHB models: e'j iBruzual & Charlojh993h 
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